Time Series Compressibility and Privacy
نویسندگان
چکیده
In this paper we study the trade-offs between time series compressibility and partial information hiding and their fundamental implications on how we should introduce uncertainty about individual values by perturbing them. More specifically, if the perturbation does not have the same compressibility properties as the original data, then it can be detected and filtered out, reducing uncertainty. Thus, by making the perturbation “similar” to the original data, we can both preserve the structure of the data better, while simultaneously making breaches harder. However, as data become more compressible, a fraction of the uncertainty can be removed if true values are leaked, revealing how they were perturbed. We formalize these notions, study the above trade-offs on real data and develop practical schemes which strike a good balance and can also be extended for on-the-fly data hiding in a streaming environment.
منابع مشابه
Contents 1 Privacy Preservation on Time Series 1
In this chapter, we discuss the problem of time series privacy preservation. A time series is a sequence of values that represent observations taken at constant time intervals. Time series data are prevalent in a wide range of domains and applications. However, data owners or publishers may not be willing to reveal the exact values of the time series due to privacy considerations. Thus, the dat...
متن کاملA New Piecewise EOS for Compressibility Factor Prediction Based on the M-factor Theory
In this study for the first time a new description of compressibility factor is rendered based on the virial expansion. The compressibility factor as a function of M-factor is qualitatively and quantitatively expressed. At first, we present how may the third, fourth and higher order virial coefficients be logically ignored in order to simplify the virial equation. The results show, wh...
متن کاملPrivacy-Utility Trade-Off for Time-Series with Application to Smart-Meter Data
We consider the online setting where a user would like to continuously release a time-series of data that is correlated with his private data, to a service provider in the hope of deriving some utility. Due to correlations, the continual observation of the released time-series puts the user at risk of inference of his private data by an adversary. To protect the user from inference attacks on h...
متن کاملOn Privacy in Time Series Data Mining
Traditional research on preserving privacy in data mining focuses on time-invariant privacy issues. With the emergence of time series data mining, traditional snapshot-based privacy issues need to be extended to be multi-dimensional with the addition of time dimension. We find current techniques to preserve privacy in data mining is not effective in preserving time-domain privacy. We present da...
متن کاملA new class of attacks on time series data mining\m{1}
Traditional research on preserving privacy in data mining focuses on time-invariant privacy issues. With the emergence of time series data mining, traditional snapshot-based privacy issues need to be extended to be multi-dimensional with the addition of time dimension. We find current techniques to preserve privacy in data mining are not effective in preserving time-domain privacy. We present t...
متن کامل